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CHARACTERIZATION OF THE GSK-3p PROTEIN 
AND METHODS OF USE THEREOF 

FIELD OF THE INVENTION 
5 This invention relates to the three-dimensional structure of human glycogen 

synthase Idnase 3 (GSK3), to crystals of a construct of GSK3, to methods for 
forming crystals of the GSK3 construct, to methods for determining the crystal 
structure of the GSK3 construct, and to methods for using the three-dimensional 
structure of GSK3 to identify possible therapeutic compounds for the treatment of 
10 various disease conditions mediated by GSK3 activity. 

BACKGROUND OF THE INVENTION 
Glycogen synthase kinase 3 (GSK3) is a serine/threonine Idnase for which 
two isoforms, a and p, have been identified. Woodgett, Trends Biochem. Set, 
16:177-81 (1991). Both GSK3 isoforms are constitutively active in resting, cells. 
15 GSK3 was originally identified as a kinase that inhibits glycogen synthase by direct 
phosphorylation. Upon insulin activation, GSK3 is inactivated, thereby allowing the 
activation of glycogen synthase and possibly other insulin-dependent events, such as 
glucose transport. Subsequently, it has been shown that GSK3 activity is also 
inactivated by other gro^vth factors that, like insulin, signal through i-eceptor tyrosine 
20 kinases (RTKs). Examples of such signaling molecules include IGF-1 and EGF. 
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Saito et at., Biochem. J,, 303:27-31 (1994); Welsh et al., Biochem. J. 294:625-29 
(1993); and Cross et al., Biochem, J., 303:21-26 (1994). 

Agents that inhibit GSK3 activity are useful in the treatment of disorders that 
. are mediated by GSK3 activity. In addition, inhibition of GSK3 mimics the 
5 activation of growth factor signaling pathways and consequently GSK3 inhibitors are 
useful in the treatment of diseases in which such pathways are insufficiently active. 
Examples of diseases that can be treated with GSK3 inhibitors include diabetes, 
Alzheimer's disease, CNS disorders such as bipolar " disorder, and immune 
potentiation-related conditions, among others. 
10 Because inhibitors of GSK3 are useful in the treatment of many diseases, the 

identification of new^ inhibitors of GSK3 would be highly desirable. The present 
invention provides a method for identifying possible therapeutic compounds for the 
treatment of various disease conditions mediated by GSK3 activity. The method of 
the present invention utilizes the three-dimensional structure of a GSK3 construct 
15 that contains the protein's catalytic domain to identify possible therapeutic 
compounds and to optimize the structure of lead therapeutic compoimds. 

SUMMARY OF THE INVENTION 
In accordance with the present invention, the three-dimensional stmcture of a 
construct of human glycogen synthase kinase 3 (GSK3) is provided. 
20 In one aspect, the invention provides crystals of a construct of hxaman 

glycogen synthase kinase 3-p (GSK3-p) contauaing the protein's catalytic kinase 
domain. 

In another aspect of the invention, a method for crystallizing the protein 
construct to provide a GSK3 crystal sufficient for structure determmation is 
25 provided. 

In a further aspect, the three-dimensional structure of the GSK3 construct is 
provided. 

In yet another aspect, a method is provided for using the GSK3 construct's 
three-dimensional structure for the identiJScation of possible therapeutic conipounds 
30 in the treatment of various disease conditions mediated by GSK3 activity. 

BRIEF DESCRIPTION OF THE DRAWINGS 
The foregoing aspects and many of the attendant advantages of this invention 
will become more readily appreciated by reference to the following detailed 
description, when talcen in conjunction with the accompanying drawings, wherein: 
35 FIGURE 1 is an illustration of the structure of the GSK3-p construct; 
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FIGURE 2 is an illustration of the structure of the GSK3-p construct active 

site; 

FIGURE 3 is a flow diagram of a representative method of the invention 
using the three-dimensional structure of the GSK3-P . qonstract for identifying 
5 possible therapeutic compounds for mediating GSK3-p activity; and 

FIGURE 4 is a flow diagram of a representative method of the invention 
using the three-dimensional structure of the GSK3-P construct for identifying 
possible therapeutic compounds for mediating GSK3-P activity. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT 
10 In accordance with the present invention, crystals of a protein construct of 

human glycogen synthase kinase 3-p (GSK3-P) containing the protein's catalytic 
kinase domain, methods for crystallizing the protein construct, the three-dimensional 
structure of the protein construct, and methods for using the three-dimensional 
structure for the identification of possible therapeutic compounds in the treatment of 
15 various disease conditions mediated by GSK3-P activity are provided. 

The GSK3-P Protein Construct: Expression, Purification, and Crystallization 
In one aspect, the invention provides a composition that includes a GSK3-P 
construct that contains the protein's catalytic kinase domain. The construct includes 
at least residues 37-384 of human GSK3-P and lacks the 36 amino acids at the 
20 proteia's C-terminus. The composition is a crystalline form sufficient for stmcture 
determination by diffraction studies by X-ray. 

It Avill be appreciated that GSK3 protein constructs other than the construct 
described herein, for example, active mutants or variants thereof, can provide three- 
dimensional structural information useful in identifying possible therapeutic 
25 compoxmds in the treatment of various disease conditions mediated by GSK3 
activity. 

Construct Sequence . The construct sequence, SEQ ID NO: 1, is provided 
below. The asterisk indicates the first residue that is seen in tii^ crystal structure. The 
following constmct and additional useful constructs and their preparation are 
30 described in co-pending U.S. Patent Application Serial No. 60/221,242, filed July 27, 
2000, the disclosure of which is incorporated herein by reference in its entirety and 
for all purposes. 

N^terminus: MEYMPMEGGGGSK 

*VTTVVATPGQGPDRPQEVSYTDTKVIGNGSFGVVYQAKLCDSGELV 
3 5 AIKKVLQDKRPKNRELQIMRKLDHCNIVRLRYFF YSSGEKKDEVYLNLVLD Y 
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VPETVYRVARHYSRAKQTLPVIYVKLYMYQLFRSLAYIHSFGICHRDIKPQN 
LLLDPDTAYLKLCDFGSAKQLVRGEPNVSYICSRYYRAPELIFGATDYTSSID 
VWSAGCVLAELLLGQPIFPGDSGVDQLVEIIKVLGTPTREQIREMNPNYTEFK 
FPQIKAHPWTKVFRPRTPPEAIALCSRLLEYTPTARLTPLEACAHSFFDELRDP 
5 NVKLPNGRDTPALFNFTTQELSSNPPLATILIPPHARI: C-terminus (SEQ ID NO; 1) 

Construct Purification . The GSK3-P protein construct was extracted from 
SF-9 cells infected with a baculovirus carrying GSK3-P 580 cDNA construct. The 
GSK3-P protein construct was purified to apparent homogeneity using S-Fractogel, 
Phenyl-650 M, and Glu-tag affinity chromatographies. The purified protein was then 
10 concentrated for crystallization. Purification of the construct is described in 
Example 1. 

Construct Crystallization . Protein crystals can be formed from solutions of 
the GSK3 construct by, for example, the hanging drop technique. A representative 
method for forming suitable crystals of the GSK3 construct suitable for structure 

15 determination is described in Example 2. 

It will be appreciated that various crystallization methods including, for 
example, microcrystallization methods can be utilized to obtain three-dimensional 
structural information useful in identifying possible therapeutic compounds in the 
treatment of various disease conditions mediated by GSK3 activity. 

20 The GSK3-I3 Protein Construct Structure 

In another aspect of the invention, the three-dimensional structure of the 
GSK3 protein construct is provided. Amino acid sequence data and atomic 
coordinates derived from X-ray diffraction data were used to determine the 
construct's three-dimensional structure. The construct's atomic coordinates were 

25 calculated from an electron density map produced from the combination of X-ray 
diffraction and phase data. 

With the GSK3 construct available in crystalline form suitable for structural 
determination, the crystal structure can be obtained by a variety of techniques. In a 
representative method, diffraction patterns were obtained using an X-ray image plate 

30 device. Phase data was then obtained by a combination of molecular replacement 
and cross-crystal averaging techniques. Electron density maps were then constructed 
and the stmcture solved and molecule built. The resulting structure was refined and 
the structure validated. The ultimate result was an atomic model of the GSK3 
construct. A representative method for obtaining the GSK3 crystal structure is 

3 5 described in Example 3 . 
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It will be appreciated that the GSK3 structure can be solved by a variety of 
methods. 

The statistics for collecting the crystallographic data are summarized in 
Table 1. 

5 Table 1. Data and Model Statistics for Structure Solution. 
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The three-dimensional structure of the GSK3-P construct provided as a 
tabulation of atomic coordinates is given in Table 2. In the table, "OH2'' and "GOL" 
10 refer to structural water and glycerol molecules, and "TER" refers to the terminus of 
a peptide chain. 

Table 2. GSK3-(3 Construct Atomic Coordinates . 
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15. 


831 


1. 


00 


31. 


62 


25 


ATOM 


5673 


0H2 


TIP 


83 


25. 


,823 


76. 


,105 


11. 


243 


1. 


00 


25, 


69 




ATOM 


5674 


OH2 


TIP 


84 


75. 


,453 


38. 


.146 


30, 


,507 


1. 


,00 


25. 


25 




ATOM 


5675 


0H2 


TIP 


85 


56. 


.540 


32. 


.057 


24. 


,557 


1, 


,00 


40. 


,19 




ATOM 


5676 


OH2 


TIP 


86 


68. 


. 921 


53. 


.407 


27. 


,143 


1. 


.00 


32. 


,24 




ATOM 


5677 


0H2 


TIP 


87 


48 . 


.823 


99, 


.292 


13. 


,844 


1. 


,00 


48. 


,14 


30 


ATOM 


5678 


0H2 


TIP 


88 


66, 


. 603 


27 . 


. 914 


24. 


.871 


1. 


.00 


24 . 


,07 




ATOM 


5679 


0H2 


TIP 


89 


72. 


.553 


25, 


. 024 


4 , 


.480 


1. 


.00 


37 . 


.79 




ATOM 


5680 


0H2 


TIP 


90 


74 . 


.805 


44 , 


. 624 


26. 


. 080 


1. 


.00 


24 . 


.08 




ATOM 


5681 


0H2 


TIP 


91 


82, 


.314 


60 


.516 


10, 


.339 


1, 


.00 


28 . 


.44 




ATOM 


5682 


0H2 


TIP 


92 


41, 


.009 


72 


.787 


-2, 


. 680 


1, 


.00 


49. 


. 99 


35 


ATOM 


5683 


0H2 


TIP 


93 


75 


.218 


36 


.521 


28, 


.025 


1, 


.00 


21. 


.73 
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ATOM 


5684 


OH 2 


TIP 


94 


58, 


.786 


33. 


.514 


-1. 


584 


1. 


00 


46. 


. 12 




ATOM 


5685 


•0H2 


TIP 


95 


59 


.443 


56, 


.186 


13. 


092 


1. 


00 


24 , 


.57 




ATOM 


5686 


OH2 


TIP . 


96 


44, 


.571 


79. 


.740 


17. 


158 


1. 


00 


33. 


.72 




ATOM 


5687 


OH2 


TIP 


97 


36, 


.543 


97. 


. 946 


. 22. 


228 


1. 


00 


20. 


.11 


5 


ATOM 


5688 


0H2 


TIP 


98 


42, 


.102 


91, 


.284 


• 4. 


898 


1. 


00 


22, 


.27 




ATOM 


5689 


0H2 


TIP 


99 


83, 


.993 


26, 


.068 


16. 


812 


1. 


00 


39, 


.48 




ATOM 


5690 


0H2 


TIP 


100 


71, 


.425 


22, 


.395 


31. 


294 


1. 


00 


42. 


.78 




ATOM 


5691 


0H2 


TIP 


101 


41 


. 521 


86 


.237 


20. 


453 


1. 


00 


36, 


.59 




ATOM 


5692 


0H2 


TIP 


102 


77 


.290 


21, 


.465 


18. 


797 


1. 


00 


39, 


.54 


10 


ATOM 


, 5693 


OH2 


TIP 


103 


34 


.574 


71 


. 613 


8. 


514 


1. 


00 


40 


. 98 




ATOM 


5694 


0H2 


TIP 


104 


80 


.216 


58 


. 145 


30. 


951 


1. 


00 


31 


. 65 




ATOM 


5695 


0H2 


TIP 


105 


51 


. 953 


83 


. 113 


-1. 


215 


1, 


00 


34 


.53 




ATOM 


5696 


OH2 


TIP 


106 


78 


. 065 


52 


.209 


26. 


693 


1. 


00 


31 


.02 




ATOM 


5697 


0H2 


TIP 


107 


68 


. 221 


20 


. 544 


13. 


056 


1. 


00 


29 


.84 


15 


ATOM 


5698 


0H2 


TIP 


108 


9 


. 340 


73 


. 178 


-1 . 


320 


1. 


00 


27 


. 34 




ATOM 


5699 


OH 2 


TIP 


109 


82 


.865 


28 


. 096 


7. 


563 


1 . 


00 


24 


. 08 




A-TOM 


5700 


0H2 


TIP 


110 


51 


. 681 


22 


. 615 


23. 


853 


1 . 


00 


42 


.57 




ATOM 


5701 


OH2 


TIP 


111 


80 


. 577 


21 


. 841 


24 . 


237 


1. 


00 


38 


.89 




ATOM 


5702 


OH2 


TIP 


112 


29 


. 043 


102 


. 944 


3. 


861 


1 . 


00 


36 


. 35 


20 


ATOM 


5703 


0H2 


TIP 


113 


34 


.227 


97 


.988 


29, 


010 


1. 


00 


29 


.76 




ATOM 


5704 


0H2 


TIP 


114 


40 


.735 


90 


.064 


18. 


442 


1. 


00 


46 


.42 




ATOM 


5705 


OH2 


TIP 


115 


25 


.295 


100 


.974 


13. 


456 


1. 


00 


25 


.31 




ATOM 


5706 


OH2 


TIP 


116 


75 


. 441 


52 


.065 


2. 


449 


1. 


•00 


19 


.42 




ATOM 


5707 


0H2 


TIP 


117 


74 


.070 


56 


.553 


-0. 


631 


1. 


00 


47 


.18 


25 


ATOM 


5708 


OH2 


TIP 


118 


65 


.566 


64 


.590 


9. 


062 


1. 


00 


44 


.16 




ATOM 


5709 


0H2 


TIP 


119 


60 


.666 


43 


.201 


3. 


854 


1. 


00 


34 


.57 




ATOM 


5710 


0H2 


TIP 


120 


76 


.120 


45 


.123 


29. 


,617 


1. 


,00 


36 


.08 




ATOM 


5711 


OH2 


TIP 


121 


54 


.142 


51 


.739 


14. 


.688 


1. 


,00 


31 


.52 




ATOM 


5712 


0H2 


TIP 


122 


39 


.719 


105 


.971 


10. 


,102 


1, 


,00 


27 


.01 


30 


ATOM 


5713 


OH2 


TIP 


123 


85 


'.171 


44 


.498 


7, 


.479 


1, 


.00 


31 


.29 




ATOM 


5714 


OH2 


TIP 


, 124 


86 


.210 


29 


.970 


-11. 


. 184 


1. 


.00 


29 


. 65 




ATOM 


5715 


OH2 


TIP 


125 


72 


. 634 


31 


. 654 


-0. 


.249 


1. 


.00 


25 


.24 




ATOM 


5716 


0H2 


TIP 


126 


16 


. 147 


70 


.198 


22. 


.206 


1 . 


. 00 


28 


.88 




ATOM 


5717 


0H2 


TIP 


127 


37 


.231 


69 


. 031 


17 . 


. 117 


1. 


.00 


39 


. 61 


35 


ATOM 


5718 


0H2 


TIP 


128 


35 


.202 


98 


.368 


26. 


.457 


1. 


.00 


23 


.25 
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ATOM 


5719 


OH2 


TIP 


129 


101. 


. 610 


39. 


762 


-20. 


072 


1. 


00 


30. 


.45 


ATOM 


5720 


OH2 


TIP 


130 


62. 


. 699 


33. 


014 


-2. 


726 


1. 


00 


42. 


. 68 


ATOM 


5721 


OH2 


TIP 


131 


66. 


.836 


50. 


243 


30. 


662 


1. 


00 


39. 


.68 


ATOM 


5722 


0H2 


TIP 


132 


5. 


.358 


93. 


633 


. 5. 


849 


1. 


00 


48. 


.29 


ATOM 


5723 


0H2 


TIP 


133 ■ 


87. 


.786 


24 . 


130 


-16. 


104 


1. 


00 


39. 


.99 


ATOM 


5724 


OH2 


TIP 


134 . 


42. 


.301 


75. 


221 


8. 


757 


1. 


00 


42. 


.10 


ATOM 


5725 


OH2 


TIP 


135 


78, 


.036 


44 . 


723 


-10. 


662 


1. 


00 


37. 


.65 


ATOM 


5726 


OH2 


TIP 


136 


38 , 


.073 


90. 


012 


21. 


232 


1. 


00 


»23. 


.10 


ATOM 


5727 


0H2 


TIP 


137 


24 < 


.581 


92. 


415 


-2. 


442 


1. 


00 


37. 


.05 


ATOM 


5728 


0H2 


TIP 


138 


12, 


.396 


106. 


041 


18. 


053 


1. 


00 


43, 


.25 


ATOM 


5729 


0H2 


TIP 


139 


77 , 


. 977 


34 . 


593 


3. 


036 


1. 


00 


25, 


.44 


ATOM 


5730 


0H2 


TIP 


140 


40 


.297 


92. 


891 


-1. 


662 


1. 


00 


40 


.79 


ATOM 


5731 


0H2 


TIP 


141 


35 


.031 


85. 


843 


-0. 


290 


1. 


00 


30 


. 75 


ATOM 


5732 


OH2 


TIP 


142 


78 


. 177 


49. 


267 


4 . 


406 


1. 


00 


44 


. 08 


ATOM 


5733 


0H2 


TIP . 


143 


44 


. 684 


85, 


363 


-13. 


653 


1, 


00 


42 


. 96 


ATOM 


5734 


0H2 


TIP 


144 


16 


. 523 


68 . 


716 


10. 


379 


1. 


00 


44 


.38 


ATOM 


5735 


0H2 


TIP 


145 


37 


. 171 


84. 


228 


20. 


730 


1. 


00 


37 


.73 


ATOM 


5736 


0H2 


TIP 


146 


84 


. 109 


34 . 


734 


25. 


222 


1. 


00 


28 


.75 


ATOM 


5737 


0H2 


TIP 


147 


13 


.573 


65, 


829 


5. 


292 


1. 


00 


41 


.32 


ATOM 


5738 


0H2 


TIP 


148 


87 


.775 


51. 


477 


19. 


308 


1. 


00 


33- 


.04 


ATOM 


5739 


0H2 


TIP 


149 


61 


.055 


46. 


851 


29. 


860 


1. 


00 


38 


.86 


ATOM 


5740 


0H2 


TIP 


150 


30 


.709 


93. 


775 


30. 


547 


1. 


00 


35 


.96 


ATOM 


5741 


0H2 


TIP 


151 


16 


.687 


95. 


,444 


4 . 


127 


1. 


00 


34 


.28 


ATOM 


5742 


0H2 


TIP 


152 


20 


.727 


95. 


654 


3. 


238 


1. 


00 


38 


.20 


ATOM 


5743 


0H2 


TIP 


153 


77 


.522 


26. 


,578 


-16. 


,506 


1, 


,00 


34 


.79 


ATOM 


5744 


0H2 


TIP 


. 154 


17 


.542 


77. 


,986 


26. 


,146 


1. 


,00 


40 


.07 


ATOM 


5745 


0H2 


TIP 


155 


63 


.161 


38. 


,672 


-5. 


,820 


1. 


00 


54 


,97 


ATOM 


5746 


0H2 


TIP 


156 


74 


.192 


34 . 


.524 


35, 


.857 


1. 


,00 


57 


.51 


ATOM 


5747 


0H2 


TIP 


157 


21 


.552 


99. 


.456 


10. 


, 491 


1. 


,00 


27 


.88 


ATOM 


5748 


0H2 


TIP 


158 


83 


.085 


24 . 


.564 


22. 


. 892 


1. 


.00 


31 


.49 


ATOM 


5749 


0H2 


TIP 


159 


47 


.798 


39, 


.263 


-2, 


.076 


1. 


.00 


54 


. 69 


ATOM 


5750 


0H2 


TIP 


160 


88 


. 099 


51, 


.183 


23, 


. 984 


1. 


.00 


35 


.12 


ATOM 


5751 


0H2 


TIP 


161 


24 


. 423 


92, 


.769 


30, 


.535 


1. 


.00 


38 


.45 


ATOM 


5752 


0H2 


TIP 


162 


63 


. 166 


20 


. 190 


11, 


.461 


1, 


.00 


46 


,78 


ATOM 


5753 


0H2 


TIP 


163 


59 


.098 


44 


.329 


5, 


.827 


1, 


.00 


33 


.01 



wo 02/24893 
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ATOM 


5754 


OH2 


TIP 


164 


13, 


,507 


107. 


912 


20. 


781 


1. 


00 


35. 


72 




ATOM 


5755 


OH2 


TIP 


165 


83, 


, 412 


61. 


086 


22. 


636 


1. 


00 


35. 


01 




ATOM 


5756 


0H2 


TIP 


166 


49. 


, 112 


73. 


913 


6. 


534 


1. 


00 


66. 


27 




ATOM 


5757 


0H2 


TIP 


167 


91. 


,394 


32. 


212 


. 10. 


337 


1. 


00 


50. 


39 


5 


ATOM 


5758 


0H2 


TIP 


168 


6, 


, 449 


94. 


287 


26. 


888 


1. 


00 


47. 


63 




ATOM 


5759 


0H2 


TIP 


169 


76. 


.464 


68. 


219 


23. 


721 


1. 


00 


53. 


,79 




ATOM 


5760 


OH2 


TIP 


170 


22, 


.808 


74. 


862 


19. 


059 


1. 


00 


33. 


,21 




ATOM 


5761 


0H2 


TIP 


171 


48. 


.317 


80. 


149 


7. 


342 


1. 


00 


27. 


.26 




ATOM 


5762 


0H2 


TIP 


172 


41. 


. 162 


85. 


237 


-16. 


033 


1. 


00 


39. 


,98 


10 


ATOM 


5763 


0H2 


TIP 


173 


51, 


.407 


31. 


394 


15. 


649 


1. 


00 


42, 


,59 




ATOM 


5764 


OH 2 


TIP 


17 4 


22. 


. 955 


60. 


844 


22. 


421 


, 1. 


go 


38. 


.82 




ATOM 


5765 


0H2 


TIP 


175 


28 . 


. 394 


98 . 


779 


1. 


827 


1. 


00 


41. 


. 53 




ATOM 


5766 


0H2 


TIP 


176 


69, 


. 684 


65. 


594 


24 . 


524 


1. 


00 


44 . 


. 96 




ATOM 


5767 


0H2 


TIP 


177 


43 


.200 


93. 


280 


3. 


459 


1. 


00 


34 . 


. 13 


15 


ATOM 


5768 


0H2 


TIP 


178 


17 , 


.756 


90. 


119 


27 . 


355 


1 . 


00 


45. 


.97 




ATOM 


5769 


0H2 


TIP 


17 9 


105, 


.475 


35. 


722 


3. 


208 


1 . 


00 


46. 


.41 




ATOM 


5770 


OH 2 


TIP 


180 


18 


. 511 


69. 


070 


13. 


131 


1. 


00 


44 , 


.83 




ATOM 


5771 


OH2 


TIP 


181 


34 


.777 


83. 


,786 


27. 


054" 


1 . 


00 


49, 


.09 




ATOM 


5772 


0H2 


TIP 


182 


76 


.901 


25. 


642 


31. 


034 


1. 


00 


47, 


.31 


20 


ATOM 


5773 


0H2 


TIP 


183 


48 


. 876 


35. 


.97 9 


8. 


482 


1. 


00 


32, 


.37 




ATOM 


5774 


0H2 


TIP 


184 


25 


.637 


99. 


, 6'45 


26. 


739 


1. 


00 


45, 


.36 




ATOM 


5775 


0H2 


TIP 


185 


93 


.103 


36. 


,239 


0. 


414 


1. 


00 


33, 


.77 




ATOM 


5776 


0H2 


TIP 


186 


71 


.600 


55. 


.767 


30. 


268 


1. 


00 


32 


.79 




ATOM 


5777' 


0H2 


TIP 


187 


15 


.541 


76. 


.607 


2. 


456 


1. 


,00 


41 


. 98 


25 


ATOM 


5778 


0H2 


TIP 


188 


41 


.719 


74. 


.503 


14. 


650 


1. 


,00 


51 


.88 




ATOM 


5779 


0H2 


TIP 


189 


75 


.164 


69. 


.545 


21. 


068 


1. 


.00 


52 


.07 




ATOM 


5780 


OH2 


TIP 


190 


74 


.872 


39. 


.797 


-2, 


637 . 


. 1. 


,00 


41 


.24 




ATOM 


5781 


0H2 


TIP 


191 


22 


. 684 


97, 


.520 


3. 


,087 


1. 


.00 


37 


.30 




ATOM 


5782 


0H2 


TIP 


192 


25 


.467 


107, 


. 654 


1. 


,749 


1. 


.00 


41 


.11 


30 


ATOM 


5783 


0H2 


TIP 


193 


69 


.282 


51 


. 934 


2 . 


. 697 


1. 


.00 


41 


. 69 




ATOM 


5784 


0H2 


TIP 


194 


38 


. 994 


107 


.268 


27 . 


.347 


1 , 


.00 


37 


.35 




ATOM 


5785 


0H2 


TIP 


195 


63 


.092 


51 


.837 


28, 


.741 


1, 


.00 


38 


. 92 




ATOM 


5786 


0H2 


TIP 


196 


11 


.487 


77 


.282 


0. 


. 147 


1, 


.00 


36 


.21 




ATOM 


5787 


0H2 


TIP 


197 


36 


. 150 


94 


.786 


-15, 


.531 


1, 


.00 


39 


. 60 


35 


ATOM 


5788 


0H2 


TIP 


198 


82 


.360 


44 


.024 


21, 


.173 


1 


.00 


36 


.55 
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ATOM 


5789 


0H2 


TIP 


199 


35. 


092 


94 . 


097 


23, 


903 


1. 


00 


32. 


,06 




ATOM 


5790 


OH2 


TIP 


200 


63. 


021 ■ 


70. 


590 


15. 


903 


1. 


00 


56. 


.60 




ATOM 


5791 


OH2 


TIP 


201 


49. 


283 


43. 


418 


9- 


849 


1. 


00 


47. 


,58 




ATOM 


5792 


0H2 


TIP 


202 


36. 


906 


100. 


577 


. 21. 


840 


1. 


00 


40. 


,78 


5 


ATOM 


5793 


0H2 


TIP 


203 


76. 


,551 


63. 


953 


25, 


972 


1. 


00 


35, 


.46 




ATOM 


5794 


0H2 


TIP 


204 


87 . 


.842 


55. 


120 


11. 


824 


1. 


00 


29. 


. 94 




ATOM 


5795 


OH 2 


TIP 


205 


36. 


937 


66. 


014 


-8. 


244 


1. 


00 


41. 


. 61 




ATOM 


5796 


0H2 


TIP 


206 


82. 


.257 


40. 


,382 


21. 


175 


" 1. 


00 


36. 


.08 




ATOM 


5797 


0H2 


TIP 


207 


75. 


.418 


44. 


,375 


-1. 


993 


1. 


00 


40. 


. 61 


10 


ATOM 


5798 


OH 2 


TIP 


208 


8, 


,151 


69. 


166 


16. 


710 


1. 


00 


44 . 


. 90 




ATOM 


5799 


0H2 


TIP 


209 


83. 


,811 


57 . 


, 022 


19. 


559 


1. 


00 


23. 


.75 




ATOM 


5800 


OH2 


TIP 


210 


89. 


.732 


38 . 


, 502 


10. 


624 


1. 


00 


45, 


.38 




ATOM 


5801 


OH2 


TIP 


211 


57 . 


. 995 


53. 


. 9.00 


13. 


742 


1 . 


00 


45, 


. 45 




ATOM 


5802 


OH2 


TIP 


212 


41 . 


.221 


100, 


.360 


-7 . 


718 


1, 


00 


49, 


.06 


15 


ATOM 


5803 


OH2 


TIP 


213 


72. 


.092 


37. 


. 629 


32 . 


475 


1. 


00 


61, 


.51 




ATOM 


5804 


OH 2 


TIP 


214 


37 , 


.776 


74 . 


.115 


23. 


513 


1 . 


00 


53, 


. 60 




ATOM 


5805 


OH2 


TIP 


215 


95. 


.633 


49. 


.033 


19. 


239 


1. 


00 


39 


.53 




ATOM 


5806 


OH2 


TIP 


216 


15, 


.800 


72, 


. 932 


22. 


618 


1. 


00 
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The three-dimensional structure of the GSK3-P construct based on the 
derived crystal structure is schematically illustrated in FIGURE 1. The construct 
includes N-terminal and C-terminal domains with the active site formed between the 
5 two domains. The N-terminal domain includes a P-barrel. The active site region 
includes the ATP binding site, the magnesium binding/catalytic base site, and 
substrate binding site. 

The three-dimensional structure of the GSK3-|3 construct*s active site 
(including the catalytic site and substrate binding site) based on the derived crystal 
10 structure is schematically illustrated in FIGURE 2. The active site includes Prol36 
and Phe67 among other amino acid residues. 

Structural information of the apoprotein active site can provide a basis for the 
rational design of ligands leading to therapeutic compounds effective in the treatment 
of various disease conditions mediated by GSK3-P activity. Thus, the structural 
1 5 information obtained from the cry stallographic data can be used to develop a ligand 
profile and for the rational design of drugs for mediating GSK3-P activity as 
described below. 

GSK3 Structural Representation . As noted above, in one aspect, the 
invention provides a method for identifying possible therapeutic compounds in the 

20 treatment of various disease conditions mediated by GSK3-P activity. The method 
involves the use of a three-dimensional structural representation of the GSK 
construct. The three-dimensional structural representation may be a representation of 
(a) the complete GSK construct, (b) a fragment of GSK3 that includes the GSK 
construct, or (c) a fragment of the GSK construct that includes the amino acids that 

25 interact with ligands lhat can mediate GSK3 activity. 

The structural representation is preferably based on or derived from the 
atomic coordinates as set out in Table 2, which represents the structure of the 
complete GSK construct. Suitable structural representations include 
three-dimensional models and molecular surfaces derived from these atomic 

30 cooi'dinates. The coordinates in Table 2 include structural water and glycerol 
molecules. These will vary, and may even be absent, in other models derived 
structurally (they are resolution and space group dependent). These solvent 
molecules will vary from crystal to crystal. 

Variants of the atomic coordinates noted in Table 2 can also be used for the 

35 invention, such as variants in which the RMS deviation of the x, y, and z coordinates 
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for all heavy (i.e., not hydrogen) atoms are less than about 2,5A, for example, less 
than about 2 A, preferably less than about lA, more preferably less than about 0.5 A, 
or most preferably less titian about 0.1 A) compared with the atomic coordinates noted 
in Table 2. Coordinate transformations that retain the . three-dimensional spatial 
5 relationships of atoms can also be used to give suitable variants. 

The atomic coordinates provided herein can also be used as the basis of 
models of further protein structures. For example, a homology model could be based 
on the GSK construct structure. The coordinates can also be used in the solution or 
refinement of further crystal structures of GSK3, such as co-crystal structures with 
10 newligands. 

GSK3 Structural Representation Storage Medium . The atomic coordinates of 
the GSK construct can be stored on a medium for subsequent use with a 
computational device, such as a computer (e,g., supercomputer, mainframe, 
minicomputer, or microprocessor). Typically, the coordinates are stored on a 

15 medium useful to hold large amounts of data, such as magnetic or optical media (e.g., 
floppy disks, hard disks, compact disks, magneto-optical media ("floptical" disks, or 
magnetic tape) or electronic media (e.g., random-access memory (RAM), or read- 
only memory (ROM). The storage mediiim can be local to the computer, or can be 
remote (e.g., a networked storage medium, including the Internet). The choice of 

20 computer, storage medium, networking, and other devices or techniques will be 
familiar to those of skill in the stmctural/computational chemistry arts. 

The invention also provides a computer-readable medium for a computer, 
which contains atomic coordinates and/or a three-dimensional structural 
representation of the GSK construct. The atomic coordinates are preferably those 

25 noted in Table 2 or variants thereof. Any suitable computer can be used in the 
present invention. 

GSK3-B Ligand Profile Development . As noted above, the structural 
information obtained from the crystallographic data can be used to develop a ligand 
profile useful for the rational design of compounds for mediating GSK3-P activity. 
30 A ligand profile can be developed by taking into account the stmctural information 
obtained as described above for the apoprotein. The ligand profile can be further 
developed and refined with the determination of additional structures of protein with 
boimd ligands. The ultimately developed ligand profile identifies possible 
therapeutic compounds for mediating GSK3-P activity. 



wo 02/24893 



PCT/USOl/29549 



-178- 



The ligand profile can be primarily based on a shape interaction between the 
ligand and the protein ligand binding site. The evaluation of the shape interaction 
can include consideration of the ligand's conformational properties, ranking ligands 
based on their ability to achieve low energy conformations compatible with the 
5 ligand binding site. The shape interaction can also seek to maximize enthalpic 
interactions between the ligand and the binding site. 

The process of developing a ligand profile can vary widely. For example, the 
profile can be developed by visual inspection of active site structures by experts. 
Such an inspection can include the consideration of the binding site and ligand 

10 structures and compound database searching. The development of the profile can 
also consider biological data and structure activity relationships (SAR) as well the 
consideration of known ligand binding interaction with other similar proteins. 

In any event, the ligand profile is developed by considering ligand binding 
interactions including primary and secondary interactions and results in defining the 

15 pharmacophore. The term "pharmacophore" refers to a collection of chemical 
features and tliree-dimensional constraints that represent specific characteristics 
responsible for a ligand's activity. The pharmacophore includes surface-accessible 
features, hydrogen bond donors and acceptors, charged/ionizable groups, and/or 
hydrophobic patches, among other features. 

20 In addition to the process for ligand profile development noted above, other 

structure-based drug design techniques can be applied to the structural representation 
of the GSK3 constmct in order to identify compounds that interact with GSK3 to 
mediate GSK3 activity. A variety of suitable techniques are available to one of 
ordinary skill in the art. 

25 Software packages for implementing molecular modeling techniques for use 

in structure-based drug design include SYBYL (available from ^ Tripos Inc., 
http://www/tripos.com); AMBER (available from Oxford Molecular, 
http://www/oxmol.co.uk/); CERIUS^ (available from Molecular Simulations Inc., 
http://www/msn.com/); INSIGHT II (available from Molecular Simulations Inc., 

30 http://www/msn.com/); CATALYST (available from Molecular Simulations Inc., 
http://www/msn.com/); QUANTA (available from Molecular Simulations Inc., 
http://www/msn.com/); HYPERCHEM (available from Hypercube Inc., 
http://www/hyper.com/); FIRST DISCOVERY (available from Schrodinger Inc., 
http://www/schrodinger,com), MOE (available from Chemical Computing Group, 
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lit1p://www/chemcomp.com), and CHEMSITE (available from Pyramid Learning, 
http://www/chemsite.org/),. among others. 

The modeling software can be used to determine GSK3 binding surfaces and 
to reveal features such as van der Waals contacts, electrostatic interactions, and/or 
5 hydrogen bonding opportunities. These binding surfaces can be used to model 
docking of ligands with GSK3, to arrive at pharmacophore hypotheses, and to design 
• possible therapeutic compounds de novo, 
GSK3-P Ligand Virtual Screening 

The three-dimensional structure of the apoprotein, and the structure of the 

10 protein's active site in particular, allows for the determination of the fit of compounds 
into the active site. Utilizing a fast docking program,* individual compounds from, 
for example, a compound database, can be evaluated for active site binding. The fit 
of a particular compound can be evaluated and scored. Setting a score threshold can 
then provides a family of compoxmds as a solution to the virtual screen. 

15 At the first level, the virtual screen takes into account the three-dimensional 

structure of the apoprotein's active site. At the second level, the virtual screen 
considers the ligand profile and can utilize information obtained from the 
determination of the structure of protein with bound ligand. A virtual screen is 
possible even if there is no structural information on a bound ligand. 

20 Information gained from the virtual screen can be considered to further 

develop the ligand profile. Alternatively, where the results of the virtual screen 
indicate a promising compound, the compoimd can be obtained and screened for the 
relevant biological activity. 

Docking . Docking refers to a process in which two or more molecules are 

25 aligned based on energy considerations. Docking aligns the three-dimensional 
structures of two or more molecules to predict the conformation of a complex formed 
from the molecules (see, e.g., Blaney& Dixon (1993) Perspectives^ in Drug 
Discovery and Design 1:301). In the practice of the present invention, molecules are 
docked with the GSK3 construct structure to assess their ability to interact with 

30 GSK3. 

Docking can be accomplished by either geometric matching of the ligand and 
its receptor or by minimizing the energy of interaction. Geometric matching 
algorithms are preferred because of their relative speed. 

Suitable docking algorithms include DOCK (Kuntz et al (1982) 1 Mol Biol 
35 161:269-288, available from UCSF), the prototypical program for structure-based 
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drug design; AUTODOCK (Goodsell & Olson (1990) Proteins: Structure, Function 
and Genetics 8:195-202 and available from Oxford Molecxilar, 
http://www/oxmoLco.uk/), which docks ligands in a flexible manner to receptors 
using grid-based Monte Carlo simulated annealing. The flexible nature of the 
5 AUTODOCK procedure helps to avoid bias (e.g., in orientation and conformation of 
the ligand in the active site) introduced by the user searcher (Meyer et al (1995) 
Persp, Drug Disc, 3:168-95) because, while the starting conformation in a rigid 
docking is normally biased towards a minimum energy conformation of the ligand, 
the binding conformation may be of relatively high conformational energy (Nicklaus 

10 et al. (1995) Bioorganic & Medicinal Chemistry 3:41 1). 

Other suitable docking algorithms include MOE — DOCK (available from 
Chemical Computing Group Inc., http://www/chemcomp.com), in which a simulated 
annealing search algoritlim is used to flexibly dock ligands and a grid-based energy 
evaluation is used to score docked conformations; FLExX (available from Tripos 

15 Inc.,.http://www/tripos.com), which docks conformationally :g.exible ligands into a 
binding site using an incremental construction algorithm that builds the ligand in the 
site, and scores docked conformations based on the strength of ligand-receptor 
interactions; GOLD (Jones et al (1997) J, Mol Biol 267:727-748), a genetic 
algorithm for flexible ligand docking, with full ligand and partial protein flexibility, 

20 and in which energy fimctions are partly based on conformation and non-bonded 
contact information; AFFINITY (available from Molecular Simulations Inc., 
http://www/msn.com/), which uses a two step process to dock ligands: first, initial 
placements of the ligand within the receptor are made using a Monte Carlo-type 
procedure to search both conformational and Cartesian space; and second, a 

25 simulated annealing phase optimizes the location of each ligand placement, during 
this phase, AFFINITY holds the "bulk" of the receptor (atoms not in the binding site) 
rigid, while the binding site atoms and ligand atoms are movable; C^ LigandFit 
(available from Molecular Simulations Inc., http://www/msn.com/), which uses the 
energy of the ligand-receptor complex to automatically find best binding modes and 

30 stochastic conformation search tecliniques, with the best results from the 
conformational sampling retained. A grid method is used to evaluate non-bonded 
interactions between the rigid receptor and the flexible ligand atoms. DOCKIT 
(available from Metaphorics LLC) uses distance geometry for fast flexible ligand 
docking. GLIDE (available from Schrodinger Inc.) uses a pre-computed energy grid 

35 and an efficiently primed systematic search for flexible docking. 
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Preferably, the docking algorithm is used in a high-throughput mode, in 
which members of large structural libraries of potential ligands are screened against 
the receptor structure (Martin (1992) J, Med. Chem\ 35:2145-54). 

Smtable stmctural libraries include the ACD (Available Chemical Directory, 
5 ■ form MDL Inc.), AsInEx, Bionet, ComGenex, the Derwent World Drug Index 
(WDI), the Contact Service Company database, LaboTest, ChemBridge Express 
Pick, ChemStar, BioByteMasterFile, Orion, SALOR, TRIAD, ILIAD, the National 
Cancer Institute database (NCI), and the Aldrich, Fluka, Sigman and Mabridge 
catalogs. These are commercially available (e.g., the HTS Chemicals collection from 

10 Oxford Molecular, or the LeadQuest™ files from Tripos). 

Defining the Pharmacophore . As noted above, a pharmacophore can be 
defined for the GSK3 construct that includes stuface-accessible features, hydrogen 
bond donors and acceptors, charged/ionizable groups, and/or hydrophobic patches, 
among other features. These features can be weighted depending on their relative 

15 importance in conferring activit}^ (see, e.g., Computer-Assisted Lead Finding and 
Optimization (eds. Testra & Folkers, 1997). . 

Pharmacophores can be determined using software such as CATALYST 
(including HypoGen or HipHop, available firom Molecular Simulations Inc., 
http://www/msn.com/), CERIUS^, or constructed by hand from a known 

20 conformation of a lead compound. The pharmacophore can be used to screen 
structural libraries, using a program such as CATALYST. The CLIX 
program (Davie & Lawrence (1992) Proteins 12:31-41) can also be used, which 
searches for orientations . of candidate molecules in structural databases that yield 
maximum spatial coincidence with chemical groups which interact with the receptor. 

25 The DISCO program (available from Tripos) uses a method of clique detection to 
identify common pharmacophoric features in each structure, produce optimally 
aligned structures, and extract the key features of the pharmacophore. The GASP 
program (available from Tripos) uses a genetic algorithm to automatically find 
pharmacophores with conformational flexibility. 

30 de novo Compound Design . The binding surface or pharmacophore of the 

GSK3 construct can be used to map favorable interaction positions for functional 
groups (e.g., protons, hydroxyl groups, amine groups, acidic groups, hydrophobic 
groups and/or divalent cations) or small molecule fragments. Compoxmds can then 
be designed de novo in which the relevant functional groups are located in the correct 

3 5 spatial relationship to interact with GSK3 . 
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Once ftmctional groups or small molecule fragments which can interact with 
specific sites in the binding surface of GSK3 have been identified, they can be linlced 
in a single compound using either bridging fragments with the correct size and 
geometry or frameworks which can support the ftmctional groups at favorable 
5 orientations, thereby providing a compound according to the invention. While 
linking of ftmctional groups in this way can be done manually, perhaps with the help 
of software such as QUANTA or SYBYL, automated or semi-automated de novo 
design approaches can also be used. 

Suitable de novo design software includes MCDLNG (Gehlhaar et ah (1995) 

10 J. Med, Chem, 38:466-72), which fills a receptor binding site with a close-packed 
array of generic atoms and uses a Monte Carlo procedure to randomly vary atom 
types, positions, bonding arrangements and other properties; MCSS/HOOK (Caflish 
et al (1993) J. Med, Chem. 36:2142-67; Eisen et al (1994) Proteins: Sir. FuncL 
Genet. 19:199-221; available from Moleculai' Simulations Inc., 

15 http:/Awvw/msnxom), which links multiple functional groups with molecular 
templates taken from a database; LUDI (Bohm (1992) J, Comp. Aided Molec. Design 
6:61-78, available from Molecular Simulations Inc., http://www/msn.com), which 
computes the points of interaction that would ideally be ftilfilled by a iigand, places 
fragments in the binding site based on their ability to interact with the receptor, and 

20 then connects them to produce a ligand; GROW (Moon and Howe (1991) Proteins: 
Str. Fund. Genet. 11:314-328), which starts with an initial "seed" fragment (placed 
manually or automatically) and grows the ligand outwards; SPROUT (available from 
http://chem.leeds.ac.uk/ICAMS/SPROUT.html), which includes molecules to 
identify favorable hydrogen bonding and hydrophobic regions wiliiin a binding 

25 pocket (HIPPO module), selects functional groups and positions them at target sites 
to form starting fragments for structure generation (EleFanT), generates skeletons 
that satisfy the steric constraints of the binding pocket by growing spacer fragments 
onto the start fragments and then connecting the resulting part skeletons (SPIDeR), 
substitutes hetero atoms into the skeletons to generate molecules with the 

30 electrostatic properties that are complementary to those of the receptor site 
(MARABOU), aad the solutions can be clustered and scored using the ALLigaTOR 
module; LEAPFROG (available from Tripos Inc., http://www/tripos.com), which 
evaluates ligands by making small stepwise structural changes and rapidly evaluating 
the binding energy of the new compound, keeps or discards changes based on the 

35 altered binding energy, and evolves structures to increase the interaction energy with 
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the receptor; GROUPBUILD (Rorstein et al (1993) J, Med, Chem. 36:1700), which 
uses a library of common organic templates and a complete empirical force field 
description of the non-bonding interactions between a ligand and receptor to 
construct ligands that have chemically reasonable structure and have steric and 
5 electrostatic properties complimentary to the receptor binding site; CAVEAT (Lauri 
and Bartlett (1994) Comp, Aided Mol Design 8:51-66), which designs linking imits 
to constrain acyclic molecules; and RASSE (Lai (1996) J, Chem, Inf, Comput Set 
36:1187-1194). 

GSK3-B Ligands 

10 Most lead compounds that initiate structure-based design cycles are identified 

by high-throughput screening. As a result of high throughput screening and the 
ligand profile and virtual screening described above, ligands are identified having the 
requisite conformational energies to assume a suitable shape and bind with the 
protein's active site. In addition to having low conformational energy and spatial 

15 compatibility with the apopi^otein active site, the identified ligands are preferably 
synthetically accessible. The identified ligands can then be obtained (e.g., 
commercially obtained or sjmthesized) and screened for biological activity. The 
identified ligands can also be co-crystallized with the protein construct and the 
three-dimensional structure determined for the protein with bound ligand. The 

20 information obtained from structure of the protein with boimd ligand can then be 
used to further develop the ligand profile as described above. 

Suitable GSK3-|3 biological screening methods for evaluating ligand 
biological activity are known and include, for example, those noted in U.S. patent 
application Serial No. 60/1 93, 043 , filed March 29, 2000, and expressly incorporated 

25 herein be reference in its entirety. 

Method for Rational Drug Discovery Using GSK3 Crystal Structures 
In another aspect, the invention provides a method for using a GSK3 crystal 
structure, specifically the three-dimensional structure of the GSK3 constructs active 
site, to design ligands for binding to and mediating the activity of GSK3-p. 

30 In one embodiment, the method is an iterative structure-based method for 

therapeutic compound design. A representative method is depicted by the flow 
diagram shown in FIGURE 3. Referring to FIGURE 3, the crystal structures of the 
apoprotein and the protein with bound ligand are determined in steps 102 and 104, 
respectively. From the structural information obtained from steps 102 and 104, a 

35 ligand profile is developed in step 106. A ligand profile can also be developed 
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directly from the crystal structure of the apoprotein. Using the resulting profile, new 
ligands can be designed and/or obtained, screened for biological activity, and/or 
co-crystallized with the protein in step 108, or alternatively, the ligand profile can be 
used in a virtual screen in step 110. If the ligand obtained from, the developed profile 
5 is co-crystallized, the structure of the co-crystal is determined in step 104 and the 
resulting structural information is used to further develop the ligand profile in 
step 106. If the ligand profile is used in a virtual screen in step 110, the virtual 
screen is either successful and identifies one or more ligands that can be obtained, 
screened, and/or co-crystallized in step 108. If the virtual screen is unsuccessful in 

10 identifying a suitable ligand, the ligand profile is further developed in step 106. 

Lead compounds can be identified from biological screening of ligands 
developed by the ligand profile. A representative method for identifying a lead 
compound is depicted by the flow diagram shown FIGURE 4. Referring to 
FIGURE 4, the crystal structures of the apoprotein and the protein with bound ligand 

15 are determined in steps 202 and 204, respectively. From the structural information 
obtained from steps 202 and 204, a ligand profile is developed in step 206. A ligand 
profile can also be developed directly from the crystal structure of the apoprotein. 
From the resulting profile, a new ligand can be designed and/or obtained in step 208, 
and either screened for biological activity in step 210 and/or co-crystallized with the 

20 protein in step 212. If the biological screen is successful, a lead compound is 
identified in step 214. In a subsequent iteration, the lead compound can be co- 
crystallized in step 212 and iterations continued until a new drug carididate is 
identified. If the biological screen is not successful, that information can be used to 
further develop the ligand profile in step 206. If the ligand is co-crystallized, the co- 

25 crystal structure can be determined in step 204 and the structural information used in 
further developing the ligand profile in step 206. 

Altematively, the results of the ligand profile can be used in a virtual screen 
in step 216. If the virtual screen is successful and identifies one or niore ligands, the 
ligand can be obtained in step 208 and screened in step 210 to determine its 

30 biological activity and whether or not a lead compound has been identified. The 
ligand obtained in step 208 can also be co-crystallized in step 212 and its structure 
determined and the resulting information used to further develop the ligand profile. 
If the virtual screen is unsuccessful in identifying a suitable ligand, the ligand profile 
is further developed in step 206. 
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GSK3 Ligands and Their Uses 

The method of the invention identifies Ugands that can interact with GSK3. 
These compounds can be designed de novo^ can be known compounds, or can be 
based on known compounds. The compounds can be useful pharmaceuticals 
5 themselves, or can be prototypes that can be used for further pharmaceutical 
refinement (i.e.; lead compounds) in order to improve binding affinity or other 
pharmacologically important features (e.g., bio-availability, toxicology, metabolism, 
pharmacokinetics). 

Accordingly, in another aspect, the invention provides (l)a compoxmd 
10 identified using the method of the invention; (2) a compound identified ixsing the 
method of the invention for use as a pharmaceutical; (3) the use of a compoxmd 
identified using the method of the invention in the manufacture of a medicament for 
mediating GSK3 • activity; and (4) a method of treating a patient afflicted with a 
condition mediated by GSK3 activity that includes administering an amount of a 
1 5 compound identified using the method of the invention that is effective to mediate 
GSK3 activity. 

These compounds preferably interact with GSK3 with a binding constraint in 
the micromolar or, more preferably, nanomolar range or stronger. 

As well as being useftil compoimds individually, ligands identified by the 
20 structure-based design techniques can also be used to suggest libraries of compounds 
for traditional in vitro or in vivo screening meliiods. Important pharmaceutical motifs 
in the ligands can be identified and mimicked in compound libraries (e.g., 
combinatorial libraries) for screening for GSK3 -binding activity. 

The foregoing and other aspects of the invention may be better understood in 
25 coimection with the following representative examples. 

EXAMPLES 
Example 1 
GSK3-B Construct Purification 
In this example, the purification of the GSK3-p protein construct is described. 
30 The construct was extracted firom SF-9 cells infected with a baculovirus carrying 
GSK3-P 580 cDNA construct and purified to apparent homogeneity using 
S-Fractogel, Phenyl-650 M, and Glu-tag affinity chromatographies as described 
below. 

Extraction . Cell paste from 20L fermentation of infected SF-9 cells was 
35 washed 100 mL PBS (10 mM NaPi, pH7.5, 150 mM NaCl) and then resuspended 
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with 300 mL of Buffer H (20 mM Tris, pH7.5, 1 mM tungstate, 1 mM arsenate, 
50 mM DTT, 10 [ag/mL leupeptin, 1 jxg/mL pepstatin A, 10% glycerol, 0.35% octyl 
glycoside, 1 mM Mg2+). Cells were homogenized in a 100-mL Douncer (20 strokes 
with pestelB). The combined homogenate was centrifuged in a Ti45 rotor at 
5 40,000 rpm for 35 minutes to remove cell debris and nuclei. The supernatant from 
the centrifugation were carefully decanted and filtered through 0.45 \i filter. 

S-Fractogel Chromatoearaphv . 175 mL S-fractogel (EM Science, Cat #1 8882) 
was packed into 5 cm x 8.9 cm colunrn and equilibrated with 5 column volumes of 
Buffer A (20 mM Tris, Ph7.5, 10% glycerol). Prior to loading the filtered 

10 supernatant, one column volume of Buffer A containing 50 mM DTT was passed 
over the equilibrated column. The filtrate from the previous step was then loaded at 
20 mL/min onto the column. The column was washed with 3 column volumes of 
Buffer A containing 50 mM DTT and 2 column volumes of Buffer A and then eluted 
with a linear gradient from 0 to 1 M NaCl in Buffer A over 20 column volumes. The 

15 eluant was fractionated into 20 mL fractions. Fractions containing GSK3 were 
detected by Western Blot using anti-GSK antibody (Santa Cruz Biotech, 
Cat# SC-7291). The Western-Blot positive fractions were pooled and mixed with 
equal volume of Buffer M (20 mM Tris, pH7.5, 10% glycerol, 3.1 M NaCl) and 
filtered through a 0.45 |a filter. The filtrate was used for Phenyl-650 

20 M chromatography. 

Phenvl-650 M Cliromatographv . 37.5 mL Phenyl-650 M (Tosohass, 
' Cat #014943) was packed into a 2.2 x 10 cm colunrn and equilibrated with 5 column 
volumes of Buffer C (20 mM Tris, pH 7.5, 10% glycerol, 1.6 M NaCl). Filtrate from 
S-fractogel step was loaded onto the colunrn at 7.5 mL/min. After the loading was 

25 completed, the column was washed with 5 column volumes of Buffer C and eluted 
with linear gradient fi:om 0% to 100% Buffer A (20 mM Tris, pH 7.5, 10% glycerol) 
over 20 column volumes. Fractions were collected at 15mL each and GSK 
containing firactions were detected by Westem Blot using anti-GSK antibody. The 
Western positive fractions were pooled and loaded onto a Glu-tag antibody affinity 

30 column. 

Glu-tag Affinity Chromatography . 50 mg of Glu-tag aatibody was 
immobilized onto 28 mL of Affi-Gel 10 (BioRAD, Cat #153-6046) and the packed 
into 2,2 X 6.5 cm column. The column was equilibrated with 5 column volumes of 
Buffer D (20 mM Tris, pH 7.5, 20% glycerol, 0.3 M NaCl, 0.2% octylglucoside) and 
35 the fraction pool from Phenyl-650 M step was loaded at 2.8 mL/min. After the 
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loaded was completed, the colxomn was wash with 5 column volumes of Buffer D aad 
then eluted with 100 mL Glu-tag peptide (100 M-g/mL) in Buffer D and fractionated 
into 4 mL fractions. GSK containing fractions were detected with SDS-PAGE aad 
Coomassie Blue staining. These fractions were pooled, concentrated, and diafiltered 
5 into Buffer D to approximately 4.8 mg/mL in an Amicon concentrator using a 10 k 
MWCO YMIO membrane. The concentrated material was then submitted for 
crystallization. ' 

Example 2 
GSK3-P Construct Crystallization 

10 In this example, the crystallization of the GSK3-p protein construct is 

described. Crystallization protocol utilizes GSK3P at a concentration of 4.8 mg/ml 
and in a solution containing IX TBS buffer, 0.3 M NaCl , 20% glycerol, 0.2 % octyl 
glucopyranoside, 5 mM DTT,. The crystals were grown by the vapor diffusion using 
the hanging drop method. Briefly, each well of a 24-well linbro culture plate was 

15 filled with 0.5 mL of a solution containing 10% polyethylene glycol 6000 (PEG 
6000), 5%) 2-methyl-2,4-pentanediol (MPD), 0.1 M HEPES, pH 7.5. 3 uL of protein 
solution was then mixed with 3 uL of well solution in a drop on a glass cover slip. 
The cover slip was then placed over the reservoir of the well. The plate was then 
stored at 4°C. Crystals appeared in 50-90% of the wells in 3-30 days. 

20 Representative GSK3 crystals are highly polymorphic and exhibit variance in 

both the space-group and unit cell dimensions, often showing space group transitions 
during compounds soaking and in response to subtle changes in conditions. Three 
space groups have been identified in GSK3 crystals and are given below. The space 
group dimensions in all the space groups and the beta angle in space group C2 can 

25 often vary by as much as 10% between individual crystals. The space group and 
dimensions are as follows. C222(l) is the most prevalent space group, followed by 
C2, and then P222(l). C2 is seen only when a compound has been soaked into the 
crystal. 



30 C222(l): a=89.874 b=102.514 c=108.194 alpha=90 beta=90 gamma=90 

C2: a=102.366 b=89.598 c-1 10.438 alpha=90 beta=96.983 gamma=90 
P222(l): a=86.328b=103.061 c=102.846 alpha=90 beta=90 gamma-90 

35 
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Tlie crystals can be cryoprotected for data collection in a cryosolution 
consisting of 12% PEG 6000, 11% MPD, 0.1 M HEPES pH 7.5, 20% glycerol. The 
the cryosolution can include from about 10 to about 14 percent by weight 
polyethylene glycol (PEG 6000), from about 9 to about 13 percent by weight 2- 
5 methyl-2,4-pentanediol (MPD), and from about 18 to about 22 percent by weight 
glycerol. The cryosolution can have a pH of from about 7.3 to about 7.7. 

Example 3 

GSK3-P Construct Crystal Structure Resolution 
In this example, a representative method for resolving the crystal structure of 

10 the GSK3-P protein construct is described. 

The crystal structure of GSK3P was solved using a combination of molecular 
replacement and cross-crystal averaging. A molecular replacement solution for the 
C222(l) crystal form was obtained using a homology model of GSK3p (containing 
residues 53-352) and the program EPMR [CR Kissinger, DK Gehlhaar, and DB 

15 FogeL Rapid automated molecular replacement by evolutionary search. Acta 
Crystallogr D Biol Crystallogr. 1999 Feb;55 (Pt 2):484-91]. The homology model 
was constructed using the phosphorylated form of CDK2 (PDB accession code 1 JST) 
as the template. This initial molecular replacement solution gave a model with an 
R-factor of 53.9%. The model was then processed through several rounds of a 

20 refinement macrocycle. A typical refinement macrocycle consists of 100-200 
roimds of conjugate gradient minimization, simulated annealing with either torsion or 
Cartesian dynamics, and grouped or individual temperature factor calculation. All 
refinement procedures were executed using the program CNX (Molecular 
Simulation, Inc.) This was followed by calculation of new electron density maps and 

25 manual rebuilding of the model based on features within these maps using the 
program O (DATAONO AB). All data from 30A-2.8A in the C222(l) set of data 
was used. Despite numerous macrocycles of refinement, it was not possible to 
improve the model to better than an R-factor of 35% (free-R factor of 0,41). This 
bias towards an incorrect structure was caused by the lack of high-resolution data and 

30 the low ratio of parameters to measurable data in the refinement. 

To alleviate this bias, cross-crystal averaging was used to improve the phases 
provided by the initial molecular replacement solution. In cross-crystal averaging, 
two or more sepai*ate "views" of a structure are obtained from crystals of vastly 
different unit cell dimensions and/or space-groups. The electron densities of these 

35 views are then averaged together to provide more accurate phases for the Fourier 



wo 02/24893 



-189- 



PCT/USOl/29549 



reconstruction of the electron density and an improved view of the actual electron 
density within each of the separate crystals. For the cross-crystal averaging, the 
C222(l) crystal data set and C2 crystal data set (obtained from soaking a GSK3p 
crystal in 1-3 mM of Compound 1 (illustrated below) were used. Two molecular 
5 replacement solutions were obtained for the C2 crystal form usmg the partially 
refined GSK3P model; these solutions represented the two molecules of GSKSp in 
the asymmetric unit of the C2 cell. The real-space transforms between the molecule 
in the C222(l) crystal form and the molecules in the C2 crystal form were then 
calculated. Following this, the phases and figures-of-merit from an ensemble of 10 

10 structures run through the refinement macrocycle in the C222(l) crystal form were 
averaged against the data from the C2 crystal form using the program DMMULTI 
[K. Cowtan. Joint CCP4 and ESF-EACBM Newsletter on Protein Crystallography 
1994; 31, pp 34-38] and all data to 2.7 A, One hundred cycles of averaging, solvent- 
flattening, and histogram matching were performed. Following this, improved 

15 electron density maps were calculated and the model was rebuilt, followed by 
another refinement macrocycle, and another round of cross-crystal averaging. Five 
total averaging/refinement cycles were performed. In cycles 2-5, major errors were 
found in the C222(l) model and corrected, and the rest of the macromolecule was 
built (residues 37-384). The final C222(l) crystal structure consists of the 

20 biologically relevant molecule, 79 waters, and one putative chloride ion, and has an 
R-factor of 22.46% and a free-R factor of 28.22% usuig data from 30A-2.7A. 

Compoxmd 1 : 



O 




Following the completion of the refinement of C222(l) crystal stmcture, the 
resulting model and molecular replacement were then used to determine the position 
and structure of the molecule in the two soaked-compound C2 crystals and the 
30 P222(l) crystal. The current best GSK3P stmctare is from a P222(l) crystal form 
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obtained dtiring a soak with Compoimd 2 (illustrated below) which diffracts to 2.2A. 
This structure consists of two copies of the biologically relevant macromolecule in 
the asymmetric unit, 336 waters, 6 glycerols; this structure has an R-factor of 26.2% 
and free-R factor of 30.8% using data from 30 A-2.2 A. Compound 2 has not 
5 reached high enough occupancy in the active site to be observed. 
Compound 2: 



O 




10 This approach was used to determine the structure due to the limiting nature 

of available data collection facilities at the time. More traditional methods such as 
single/multiple isomorphous replacement or MAD phasing by themselves or in 
conjunction with molecular replacement could have been used equally as well. A 
description of these methods and other crystallographic principles can be found in 

15 B.K. Shoichet and D.E. Bussiere, The role of macromolecular crystallography and 
structure for dmg discovery: advances and caveats. Current Opinion in Drug 
Discovery & Development 2000; 3(4): 408-422. 

While the preferred embodiment of the invention has been illustrated and 
described, it will be appreciated that various changes can be made therein without 

20 departing from the spirit and scope of the invention. 
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The embodiments of the invention in which an exclusive property or privilege 
is claimed are defined as follows: 

1. A method for providing an atomic model of a GSK3 protein, 
comprising: 

(a) providing a computer readable medium having stored thereon atomic 
coordinate/x-ray diffraction data of the GSK3 protein in crystalline form, the data 
sufficient to model the tliree-dimensional structure of the GSK3 protein; 

(b) analyzing the atomic coordinate/x-ray diffraction data from (a) to 
provide data output defining an atomic model of the GSK3 protein; and 

(c) obtaining atomic model output data defining the three-dimensional 
structure of the GSK3 protein. 

2. A computer readable medium having stored thereon atomic model 
data of the GSK3 protein produced by the method of Claim 1 . 

3. A GSK3-P ligand corresponding to the physical model of the atomic 
model of the ligand model produced by the method of Claim 1 . 

4. A method for designing ligands that bind to a GSK3 protein, 
comprising using some or all of the atomic coordinates of the GSK3 construct 
presented in Table 2. 

5. A method for designing ligands that bind to a GSK3 protein, 
comprising: 

(a) providing a purified GSK3 protein; 

(b) crystallizing the purified GSK3 protein to provide a crystallized GSK3 
protein having biological activity; 

(c) resolving the structure of the crystallized GSK3 protein using x-ray 
crystallography to obtain data suitable for three-dimensional structure determination 
of the GSK3 protein; 

(d) applying the data generated fi-om resolving the structure of the 
crystallized GSK3 protein to a computer algorithm to generate a model of the GSK3 
protein suitable for use in designing ligands that will bind to the GSK3 protein active 
site; and 

(e) applying an iterative process whereby molecular structures are applied 
to the computer generated model to identify GSK3 binding ligands. 
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6. The method of Claim 5, wherein the protein comprises the atomic 
coordinates set forth in Table 2. 

7. The method of Claim 5, wherein the protein comprises the amino acid 
sequence set forth in SEQ ID NO: 1 or an active mutant or variant thereof 

8. A GSK binding ligand designed by the method of any one of Claims 

4-7. 

9. A method for identifying a GSK3 mediator by determining the 
binding interactions between a potential mediator and a GSK3 binding site, the 
binding site being defined by at least some of the atomic coordinates set forth in 
Table 2, the method comprising: 

(a) generating a binding cavity defined by the binding site on a computer 

screen; 

(b) generating compounds with their spatial structure; and 

(c) determining whether the compounds bind at the GSK3 binding site. 

10. A method for identifying a compound that mediates GSK3 activity^ 
comprising: 

(a) designing a potential mediator for GSK3 that will form non-covalent 
bonds with amino acids in the GSK3 binding site based on at least some of liie 
atomic structure coordinates set forth in Table 2; 

(b) obtaining the potential mediator; and 

(c) determining whether the potential mediator mediates the activity of 

GSK3. 

11. A method for identifying a compound that mediates GSK3 activity, 
comprising: 

(a) using a three-dimensional structure of GSK3 as defined by tiie atomic 
coordinates set forth in Table 2 to design or select the potential mediator; 

(b) obtaining the potential mediator; and 

(c) contacting the potential mediator with GSK3 to determine whether the 
potential mediator mediates the activity of GSK3. 



12. A computer for producing a three-dimensional representation of a 
molecule or molecular complex, wherein said molecule or molecular complex 
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comprises a binding pocket defined by at least some of the atomic coordinates of 
GSK3 provided in Table 2, or a three-dimensional representation of a homologue of 
the molecule or molecular complex, wherein the computer comprises: 

(a) a machine-readable data storage medium comprising a data storage 
material encoded with machine-readable data, wherein said data comprises the 
atomic coordinates set forth in Table 2; 

(b) a working memory for storing instructions for processing said 
machine-readable data; 

(c) a central-processing unit coupled to said working memory and to said 
machine-readable data storage medium for processing said machine readable data 
into said tliree-dimensional representation; and 

(d) a display coupled to said central-processing unit for displaying said 
three-dimensional representation. 

13. A computer for determining at least a portion of the atomic 
coordinates corresponding to an X-ray diffraction pattern of a molecule or molecular 
complex, wherein said computer comprises: 

(a) a machine-readable data storage medium comprising a data storage 
material encoded with machine-readable data, wherein said data comprises at least a 
portion of the atomic coordinates set forth in Table 2; 

(b) a machine-readable data storage medium comprising a data storage 
material encoded with machine-readable data, wherein said data comprises an X-ray 
diffraction pattern of said molecule or molecular complex; 

(c) a working memory for storing instructions for processing said 
machine-readable data of (a) and (b); 

(d) a central-processing unit coupled to said working memory and to said 
machine-readable data storage medium of (a) and (b) for performing a Fourier 
transform of the machine readable data of (a) and for processing said machine 
readable data of (b) into structure coordinates; and 

(e) a display coupled to said central-processing unit for displaying said 
structure coordinates of said molecule or molecular complex. 

14. A method for crystallizing a human glycogen synthase kinase 3 
(GSK3) protein, comprising: 

(a) providing a purified GSK3 protein; and 
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(b) crystallizing the purified GSK3 protein to provide a crystallized GSK3 
protein having biological activity, wherein the protein is crystallized from a solution 
comprising from about 10 to about 14 percent by weight polyethylene glycol, firom 
about 9 to about 13 percent by weight 2-methyl-2,4-pentanedioi, and firom about 18 
to about 22 percent by weight glycerol, and wherein the crystallized GSK3 protein is 
resolvable using x-ray crystallography to obtain x-ray pattems suitable for three- 
dimensional structure determination of the GSK3 protein. 

15. The method of Claim 14, wherein crystallizing the GSK3 protein 
comprises crystallizing by a hanging drop vapor diffusion method. 

16. The method of Claim 14, wherein the protein comprises the atomic 
coordinates set forth in Table 2. 

17. The method of Claim 14, wherein the protein comprises the amino 
acid sequence set forth in SEQ ID NO: 1 or an active mutant or variant thereof. 

18. A crystallized GSK3 protein provided by the method of Claim 14. 
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